Source Data Perturbation in Statistical Disclosure Control

نویسنده

  • Menno Cuppen
چکیده

When tables of quantitative data are generated from a datafile, the release of those tables should not reveal information concerning individual respondents. This disclosure of individual respondents in the microdata file can be prevented by applying disclosure control methods at the table level, but this may create inconsistencies across tables. Alternatively, disclosure control methods can be executed at the microdata level, but these methods change the data permanently and do not account for specific table properties. These problems can be circumvented by assigning a weight factor to each respondent in the microdata file. Upon tabulation, each contribution of a respondent is weighted multiplicatively by the respondent’s weight factor. This approach is called Source Data Perturbation (SDP) because the data is perturbed at the microdata level, not at the table level. It should be noted, however, that the original microdata is not changed. Moreover, the weight factors can be chosen such that the tables generated from the microdata are safe, and the information loss is minimized.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Working Paper No. 30 ENGLISH ONLY UNITED NATIONS STATISTICAL COMMISSION and ECONOMIC COMMISSION FOR EUROPE CONFERENCE OF EUROPEAN STATISTICIANS EUROPEAN COMMISSION STATISTICAL OFFICE OF THE EUROPEAN COMMUNITIES (EUROSTAT)

In this paper we give an overview of various approaches to the implementation of statistical disclosure control to tabular data released through the Web. We consider three generic groups of statistical disclosure control methods: source data perturbation, output perturbation and query-set restriction. Considering different types of Web-sites and implementation approaches we discuss the appropri...

متن کامل

Measurement Error and Statistical Disclosure Control

Statistical agencies release microdata to researchers after applying statistical disclosure control (SDC) methods. Noise addition is a perturbative SDC method which is carried out by adding independent random noise to a continuous variable or by misclassifying values of a categorical variable according to a probability mechanism. Because these errors are purposely introduced into the data by th...

متن کامل

Statistical Disclosure Control for Data Privacy Preservation

With the phenomenal change in a way data are collected, stored and disseminated among various data analyst there is an urgent need of protecting the privacy of data. As when individual data get disseminated among various users, there is a high risk of revelation of sensitive data related to any individual, which may violate various legal and ethical issues. Statistical Disclosure Control (SDC) ...

متن کامل

Maximum Utility-Minimum Information Loss Table Server Design for Statistical Disclosure Control of Tabular Data

Statistical agencies typically serve a diverse group of end users with varying information needs. Accommodating the conflicting needs for information in combination with stringent rules for statistical disclosure limitation (SDL) of statistical information creates a special challenge. We provide a generic table server design for SDL of tabular data to meet this challenge. Our table server desig...

متن کامل

Polynomial-time Attack on Output Perturbation Sanitizers for Real-valued Databases

Output Perturbation is one of several strategies in the area of Statistical Disclosure Control (SDC), also known as Private Data Analysis. The general problem in SDC consists of releasing valuable information about individuals in a databasewhile preserving their privacy. Examples of this include databases containing health information about patients, customer electronic transactions, and web br...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000